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Abstract 

The theme of this paper is the derivation of analytic formulae for certain large 
combinatorial structures. The formulae are obtained via fluid limits of pure jump 
type Markov processes, established under simple conditions on the Laplace trans- 
forms of their Levy kernels. Furthermore, a related Gaussian approximation allows 
us to describe the randomness which may persist in the limit when certain parame- 
ters take critical values. Our method is quite general, but is applied here to vertex 
identifiability in random hypergraphs. A vertex v is identifiable in n steps if there is 
a hyperedge containing v all of whose other vertices are identifiable in fewer steps. 
We say that a hyperedge is identifiable if every one of its vertices is identifiable. Our 
analytic formulae describe the asymptotics of the number of identifiable vertices and 
the number of identifiable hyperedges for a Poisson(/3) random hypergraph A on a 
set V of N vertices, in the limit as iV -> oo. Here (3 is a formal power series with 
non-negative coefficients @q,(3i, ■ ■ ., and (A(A))acv are independent Poisson random 
variables such that A.(A), the number of hyperedges on A, has mean Nf3j/{ N -') when- 
ever \A\ = j. 

Keywords hypergraph, component, cluster, Markov process, random graph 
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1 Introduction 



1.1 Motivation 



We are interested in the evolution of certain statistically symmetric random structures, 
extended over a large finite set of points, when points are progressively removed in a way 
which depends on the structure. The initial condition of the structure may allow few 
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possibilities for the removal of points, indeed it may be that, once a small proportion of 
points are removed, the process terminates. On the other hand, the removal of points may 
cause the structure to ripen, eventually yielding a large proportion of the initial points. 
Our analysis will enable us to demonstrate a sharp transition between these two sorts of 
behaviour as certain parameters pass through critical values. 

Let us illustrate this phenomenon by a simple special case. Consider the complete 
graph on N vertices and declare each vertex to be open with probability p, each edge to be 
open with probability a/N. Suppose that we are allowed to select an open vertex, remove 
it, and declare open any other vertices sharing an open edge with the selected vertex. If 
we continue in this way until no open vertices remain, we eventually remove every vertex 
connected to an open vertex by open edges. We shall see that the proportion of vertices 
thus removed converges in probability as iV — > oo and that the limit z*(p, a) is the unique 
root in [0, 1) of the equation 

az + log(l — z) — log(l — p). 

Thus, for small values of p, there is a dramatic change in behaviour as a passes through 1. 
As p I 0, for a < 1, 

z*(p,a)/p 1/(1 - a) 

but for a > 1 

z*(0+,a) > 0. 

Of course this is a reflection of well known connectivity properties of random graphs, 
discovered by Erdos and Renyi [8], and discussed, for example, in 

The class of models considered in this paper is a natural generalization of some classical 
models of random graphs and hypergraphs, which may be further motivated as follows. 
Phase transitions in combinatorial problems constitute an area of active research among 
computer scientists. Many "hard" combinatorial problems can be cast as satisfiability 
problems, which seek to assign a truth value to each of a set of Boolean variables, such 
that a collection of logical conjunctions are simultaneously satisfied. Phase transitions 
for random satisfiability ( "random k-SAT" ) problems have been studied by researchers at 
Microsoft PP, PH] and IBM but difficult questions remain unanswered. The random 
hypergraph model herein may be viewed as a simplification of the random satisfiability 
model: a vertex corresponds to a Boolean variable, and a hyperedge to the set of variables 
appearing in a specific logical conjunction, neglecting the truth or falsehood assigned to 
those variables. Under this simplification, definitive critical parameters are obtained which 
shed light on the random satisfiability model, and whose derivation may serve as a template 
for analysis of mixed satisfiability problems. 
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1.2 Hypergraphs 

Let V be a finite set of N vertices. By a hypergraph on V we mean any map 

A : V{V) -> Z+. 

Here Z + denotes the set of non-negative integers. The reader may consult [7] for an overview 
of the theory of hypergraphs: however the direction pursued here is largely independent of 
previous work. We emphasise that, in distinction to much of the combinatorial literature 
on hypergraphs, we allow the possibility that more than one edge is assigned to a given 
subset, thus we are considering multi- hypergraphs. Moreover we do not insist that all 
hyperedges have the same number number of vertices. Much of the literature is restricted 
to this uniform case. Our methods allow a significant broadening of the class of models for 
which asymptotic computations are feasible. Hyperedges over vertices are called patches 
(loops in [7j) and hyperedges over are called debris. The total number of hyperedges is 

|A| = J>(A). 

A 

1.3 Accessibility and Identifiability 

Interest in large random graphs has often focused on the sizes of their connected compo- 
nents. If there is given also, as in the example above, a set of distinguished vertices Vq, 
then it is natural to seek to determine the proportion of all vertices connected to Vq. 

In the more general context of hypergraphs there is more than one interesting counter- 
part of connectivity. Given a hypergraph A on a set V, we say that a vertex v is accessible 
in 1 step or, equivalently, identifiable in 1 step if A({v }) > 1. We say, for n — 2,3, ... , that 
a vertex is accessible in n steps if it belongs to some subset A with A (A) > 1, some other 
element of which is accessible in less than n steps. A vertex is accessible if it is accessible 
in n steps for some n > 1. 

On the other hand, we say that a vertex is identifiable in n steps if it belongs to some 
subset A with A (A) > 1, all of whose other elements are identifiable in less than n steps. 
A vertex is identifiable if it is identifiable in n steps for some n > 1. 

The notion of accessibility may be appropriate to some physical models similar to 
percolation, whereas identifiability is more relevant to knowledge-based structures. We 
shall examine only the notion of identifiability. 

Given a hypergraph A without patches and a distinguished vertex Vq, we say that a 
vertex v is accessible from vq if it is accessible in the hypergraph A + 1{{u }}> that is, in 
the hypergraph obtained from A by adding a single patch at vq. Identifiability from Vq is 
defined similarly. The set of vertices accessible from v is the component of v , as studied 
in HH H21 E]. The set of vertices identifiable from v is the domain of v , as studied 
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by Levin and the current authors [5:. We shall not consider further in this paper these 
vertex-based notions. 

The process of identification is dual to the process leading to the 2-core of a graph 
or hypergraph, that is to say, the maximal subgraph in which every non-isolated vertex 
has degree at least 2. In the former process one removes vertices having a 1-hyperedge, 
in the latter one removes edges containing a vertex of degree one. In this duality, non- 
identifiable vertices correspond to the 2-core. Thus our results may be interpreted as 
giving the asymptotic size of the 2-core for a certain class of random hypergraphs. 

1.4 Hypergraph Collapse 

It will be helpful to think of the identification of vertices as a progressive activity. Once 
a vertex is identified, it is removed or deleted from the vertex set, in a manner which is 
explained below. Thus, we shall consider an evolution of hypergraphs by the removal of 
vertices over which there is a patch. A hypergraph with no patches will therefore be stable. 
Given a hypergraph A and a vertex v, we can arrive at a new hypergraph A' by removing 
v from each of the hyperedges of A. Thus 



For example, in Figure ^ the patch on the central vertex is selected, and that vertex is 
removed; this causes a triangular face to collapse to an edge, and two edges incident to the 
vertex to collapse to patches on the vertices at the other ends. Note that this leaves two 
patches on the lower left vertex. 

If A({t>}) > 1 then we say that A' is obtained from A by a (permitted) collapse. Starting 
from A, we can obtain, by a finite sequence of collapses, a stable hypergraph Aqo. Denote 
by V* the set of vertices removed in passing from A to A^. The elements of V* are the 
identifiable vertices. We write A* for the identifiable hypergraph, given by 



We note that V*, and hence A* and Aoo, do not depend on the particular sequence of 
collapses chosen. For, if vi,v 2 , ■ ■ ■ and v[,v 2 , . . . are two such sequences, and if v n ^ v' k for 
all k, then we can take n minimal and find k such that {v\, . . . , f n -i} Q {v[, ■ ■ ■ , v k }; then, 
with an obvious notation, A' k ({v n }) > A n _i({t> n }) > 1, so v n must, after all, appear in the 
terminating sequence v[,v' 2 , . . . , a contradiction. We note also that V* increases with A. 




A*(A) = A(A)l A cv*. 
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BEFORE DELETION AFTER DELETION 



Figure 1: EXAMPLE OF A PERMITTED COLLAPSE - DELETION OF ONE VERTEX 



1.5 Purpose of This Paper 

The main question we shall address is to determine the asymptotic sizes of V* and A* 
for certain generic random hypergraphs, as the number of vertices becomes large. We 
note that, since the number of hyperedges is conserved in each collapse, all the identifiable 
hyperedges eventually turn to debris: 

Aoo(0) = |A*|. 

Note that V* depends only on min{A, 1}. In the case where A(A) = for \A\ > 3, 
the hypergraph min{A, 1} may be considered as a graph on V equipped with a number of 
distinguished vertices. Then V* is precisely the set of vertices connected in the graph to 
one of these distinguished vertices. 

1.6 Poisson Random Hypergraphs 

Let (Q, J 7 , P) be a probability space. A random hypergraph on V is a measurable map 

A : Q x V{V) -> Z+. 

An introduction to random hypergraphs may be found in [H], though we shall pursue 
rather different questions here. We shall consider a class of random hypergraphs whose 
distribution is determined by a sequence (3 = (/3j : j G Z + ) of non-negative parameters. 
Say that a random hypergraph A on V is Poisson{(3) if 

• The random variables A(A),A C V, are independent, 
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• The distribution of A (A) depends only on \A\, 

• J2\ A \=j A (A) ~ Poisson(iV^), j = 0, 1, . . . , N. 

A consequence of these assumptions is that A(A) has mean N/3j/( •) whenever \A\ = j. 
Note that, when N is large, for j > 2, only a small fraction of the subsets of size j have any 
hyperedges, and those that do usually have just 1. Also the ratio of j-edges to vertices tends 
to (3j. Our assumption of Poisson distributions is a convenient exact framework reflecting 
behaviour which holds asymptotically as N — > oo under more generic conditions. 

1.7 Generating Function 

A key role is played by the power series 

0(*) = £/^ (1) 

and by the derived series 

p\t) = Yttfji?- 1 , = £i0' - l )h tJ ~ 2 - 

j>i i>2 

Let (3 have radius of convergence R. The function (3'{t) + log(l — t) may have zeros in [0, 1) 
but these can accumulate only at 1. Set 

z* = inf{t G [0, 1) : (3\t) + log(l - t) < 0} A 1 (2) 

and denote by ( the set of zeros of f3'(t) +log(l — £) in [0, z*). Note that if (3 is a polynomial, 
or indeed if R > 1, then z* < 1. Also, the generic and simplest case is where ( is empty. 

2 Results 

We state our principal result first in the generic case. 
2.1 Hypergraph Collapse - Generic Case 

Theorem 2.1. Assume that z* < 1 and ( = 0. For N e N, fe* 6e a set of N 

vertices and let A N be a Poisson{(5) hypergraph on V . Then, as N — > oo ; the numbers of 
identifiable vertices and identifiable hyperedges satisfy the following limits in probability : 

\V N *\/N -> z\ \A m \jN -> /3(^*) - (1 - z*) log(l - 2*). 
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Example 2.1. The random graph with distinguished vertices, described in the introduction 
corresponds to a Poisson(/3 7V ) hypergraph A^, where 

1 _ e -/3f =P) i _ e -*P?/(N-i) = a/N 

and (3f = for j > 3. Note that ffi = (3 X and $ -> (3 2 as N -> oo, where A = - log(l -p) 
and = a/2. Theorem 12. II extends easily to cases where /3 depends on N in such a mild 
way: one just has to check that Lemma 16.11 remains valid and note that this is the only 
place that j3 enters the calculations. We have (3{t) = — t log(l — p) + t 2 a/2 so 

tf(t) + log(l -t) = - log(l - p) + ta + log(l - t). 

Then z* is the unique t G [0, 1) such that 

at + log(l-t) = log(l-p) 

and ( is empty, so \V* N \/N — > z* in probability as — >• oo, as stated above. 

Example 2.2. To illustrate critical phenomena, let (5{t) = a(0.1+0.9t) 7 . Let x, y, and z refer 
to the re-scaled number of vertices eliminated, the number of patches, and the amount of 
debris, respectively; here "re-scaled" means after division by the number of vertices. Plots 
of y and z versus x are shown in Figure |2l for the choices a = 1185 (solid) and a = 1200 
(dashed). In the case a = 1185, y hits zero when x ~ 0.02 , and so z remains stuck at 
about 0.02. A very small increase in a, from 1185 to 1200, causes a dramatic change in the 
outcome: after narrowly avoiding extinction (Figure |2J), the number of patches explodes 
(Figure El) as x increases towards 1. 

Consider what the Figures tell us about the supercritical case a = 1200: during the 
first 4% of patch selections, there is rarely any other patch covering the same vertex as 
the one selected; Figure El shows that, during the last 10% of patch selections, an average 
of 5792 other patches cover the same vertex as the one selected. [Read the labels on the 
x-axes carefully: Figure El is a close-up of the left-most 4% of the scale of Figure El] 

2.2 Hypergraph Collapse - General Case 

In order to describe an extension of Theorem 12.11 to the case where ( is non-empty, we 
introduce the random variable 

Z = min{z E ( : W (z/(l - z)) <0}Az* 

where {W t )t>o is a Brownian motion. 
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Figure 2: CHANGES IN BEHAVIOR NEAR A CRITICAL PARAMETER VALUE 

Theorem 2.2. Assume that R (. Then, for V N * and A N * as in Theorem \2. 11 the 
following limits exist in distribution: 

\V N *\/N -> Z, \A N *\/N -> /3(Z) ~{1-Z) log(l - Z). 

In the case where ( has only a single point < z*, then Z is equal to Co with probability 
\ and equal to z* with probability |. We do not know what happens when R e £. Proofs 
will be given in Section 

3 Randomized Collapse 

We introduce here a particular random rule for choosing the sequence of moves by which 
a hypergraph is collapsed, which has the desirable feature that certain key statistics of the 
evolving hypergraph behave as Markov chains. It is by analysis of the asymptotics of these 
Markov chains as N — > oo that we are able to prove our main results. 

3.1 Induced Hypergraph 

Let A be a Poisson(/9) hypergraph. For S C V with \S\ — n, let A s be the hypergraph 
obtained from A by removing all vertices in S. Thus, for A C V\S with \A\ = j, 

BDA, B\S=A 
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0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 



Figure 3: PATCHES AND DEBRIS IN THE SUPERCRITICAL REGIME 



where the Poisson parameter Xj(N, n) is computed as follows: there are ( n ) ways to choose 
S f] B such that \B\ = i + j, and the Poisson parameter of A (B) is N(3j +i / (^-), so 



^») = JV E^(i)/( i + J -)- 

Moreover the random variables A S (A), A C V\S, are independent. 



3.2 Rule for Randomized Collapse 

Recall that the sequence of vertices chosen to collapse a hypergraph is unimportant, pro- 
vided we keep going until there are no more patches. However we shall use a specific 
randomized rule which turns out to admit a description in terms of a finite-dimensional 
Markov chain. This leads to a randomized process of collapsing hypergraphs (A n ) n > . This 
will prove to be an effective means to compute the numbers of identifiable vertices and 
identifiable hyperedges for A . 

The process (A n ) n > , together with a sequence of sets (S n ) n >o such that A n = A Sn , 
is constructed as follows. Let Sq = and A = A. Suppose that S n and A n have been 
defined. If there are no patches in A n , then S n+ x = S n and A„ +1 = A n . If there are patches 
in A n , select one uniformly at random and denote by v n+ \ the corresponding vertex; then 
set S n+l = S n U {v n+1 } and A n+i = A 5 " +1 . 
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3.3 An embedded Markov chain 

Let Y n denote the number of patches and Z n the amount of debris in A n . Then Y n = and 
Z n = | A* | for n > \V*\. Also |V*| = inf{n > : Y n = 0}. Let W n+1 denote the number 
of other patches at time n sharing the same vertex as the (n + l)st selected patch, and 
let U n+ i denote the number of 2-edges at time n containing the (n + l)st selected vertex 
v n+ i. Our analysis will rest on the observation that (Y n , Z n ) n > is a Markov chain, where, 
conditional on Y n = m > 1 and Z n = k, we have 

Y n +i = Y n — 1 — W n+ \ + f/ n+ i, Z n+ i = Z n + 1 + W n+ \ 

and where W n+1 ~ B (m - l,l/(iV-n)) and £7 n+ i ~ P ((N - n - l)\ 2 (N,n)) with W n +i 
and t/ n+ i independent. 

To see this, introduce the filtration 

T n = cr(S r , Y r , Z r : r = 0, 1, . . . , n). 

Lemma 3.1. Lei 

j9(A|5,A;,m) = P[A S = A|^A 5 ({w}) =m,A s (0) = fe]. 

T/ien 

P[A n = A|^ n ]=p(A|5 n ,y n ,Z„). 
Equivalently, for all B G T n so that B C {S n = S,Y n = m, Z n = k}, 

P[A n = A, B] = p(X\S, m, k)¥[B\. 

The claimed Markov structure for (Y n , Z n ) n > Q follows easily. 

Proof. The identity is obvious for n = 0. Suppose it holds for n. Let Be. {S n = S,Y n = 
m, Z n = k}. Take x G V\S, m! > 1 and fc' > fc. Set S' = S U {x} and £?' = {^n+i = 
S', Y n+ i = m', Z n+ i = k'} fl B. It will suffice to show, for all hypergraphs A' having m! 
patches and amount of debris k', that 

F[A n+1 = \',B']<xp(\'\S',m',k'), 

where oc denotes equality up to a constant independent of A'. But 

P[A n+1 = A', B'\ = V k -— ^P[A„ = A, B] oc Vp(A|5, m, fc). 
z — / m ^ — / 

A A 
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where the sum is over all hypergraphs A which collapse to A' on removing the vertex x. Let 
Y s , Z s denote the number of patches, amount of debris in A s respectively. Since Y s and 
Z s are conditionally independent of A s given Y s and Z s , 

J2p(MS,m,k) =F{A S ' = X'\Y S = m,Z s = k) oc p(X'\S',m', k'), 
x 

as desired. □ 



4 Exponential Martingales for Jump Processes 

We recall here some standard notions for pure jump Markov processes in W 1 and their 
associated martingales. These will be used to study the fluid limit of a sequence of such 
jump processes in Sectional 

4.1 Laplace Transforms 

Let (X t )t>o be a pure jump Markov process taking values in a subset / of M. d , with Levy 
kernel K. Consider the Laplace transform 

m(x,6)= [ e {9 ' y) K(x,dy), 9 E (R d )* 

and assume that, for some t]q > 0, 

sup sup m(x, 6) < C < oo (3) 
xei \e\<rio 

The distribution of the time T and displacement AXt of the first jump of (X t )t>o is given 
by 

P(T G dt, AX T edy\T>t, X = x) = K(x, dy)dt. 
Introduce random measures fi and v on (0, oo) x M d , given by 

v(dt, dy) = K{X t -, dy)dt 

where £(t, y ) denotes the unit mass at (t, y); v is thus the compensator of the random measure 
/i, in the sense of jTUj, p. 422. 
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4.2 Martingales Associated with Jump Processes 

The fact that v is a compensator implies that, for any previsible process a : Q x (0, oo) x 
R d -> R satisfying 



E / \a(s, y) \u(ds, dy) < oo, 



the following process is a martingale 

/ / a(s,y)(fj,- u)(ds,dy). 
Jo Jm d 

In particular, (jnj) allows us to take a(s, = y, which gives the martingale 

M t = y(n- v)(ds,dy). 

JO JR d 

Fix 7] G (0,?7o). Then there exists A < oo such that 

\m"{x,9)\<A, xel, \9\<r], (4) 
where ' denotes differentation in 9. Define for 9 G 

<Kx,9)= [ {e^-l-(9,y)}K(x,dy). 

Then <fi > and, for \9\ < rj, by the second-order mean value theorem, 

<p(x,9) = [ m"(x,r9)(9,9)(l-r)dr 



so 

<P(x,9)<±A\9\ 2 , xel, \9\<rj. 
Let (9 t ) t > be a previsible process in (IR d )* with |6> t | < r\ for all t. Set 

Z t = Z\ = exp{ f (9 S , dM s ) - [ <P(X S , 9 s )ds}. (5) 
Jo Jo 

Then (Z t )t>o is locally bounded, and by the Doleans formula ([E], P- 440), 

Z t = l+ I I Z 8 4e< «ri -l)(ji-v)(ds,dy). 

Jo JR d 
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Hence (Z t )t>o is a non-negative local martingale, so E(Z<) < 1 for all t. Hence 

E f [ \Z s 4e^ -l)\u(ds,dy) 

Jo JM d 



<E / Z s (m{X s ,9 s ) + m(X s ,0))ds < 2Ct 



o 



so (Z t )t>o is a martingale. 
Proposition 4.1. For all 5 G (0,y4?]t\/^] 



P sup \M S \ > 5 < (2d)e 



3 -<5 2 /(2Adt) 



Proof. Fix # G (R )* with \9\ = 1 and consider the stopping time 

T = inf{t > : (9,M t ) > 5}. 

For e < i], taking 9 t = 9 for all t above, we know that (Zf) t >® is a martingale. On the set 
{T < t] we have Zf? > e 5e_ylte / 2 . By optional stopping 

E«J = E(Zf) = 1. 

Hence, 

P (^sup (0, M s ) > d^j = P(T < t) < e - fc+Ate2 / 2 . 
When 5 < Atrj we can take e = 6/ At to obtain 

P(sup(0,M s ) > 5) < e- &2/2At . 

s<t 

Finally, if sup s<t \M S \ > 5, then sup s<t (9, M s ) > S/Vd for one of 9 = ±ei, . . . , ie^. □ 

5 Fluid Limit for Stopped Processes 

In this section we develop some general criteria for the convergence of a sequence of Markov 
chains in M. d to the solution of a differential equation, paying particular attention to the 
case where the chain may stop abruptly on leaving a given open set. 
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5.1 Fluid Limits 

It is possible to give criteria for the convergence of Markov processes in terms of the limiting 
behaviour of their infinitesimal characteristics. This is a powerful technique which has been 
intensively studied by probabilists. The book of Ethier and Kurtz is a key reference. 
Further results are given in Chapter 17 of ^U] and in Theorems IX. 4. 21 and IX. 4. 26 of [§]. 
A particular case with many applications is where the limiting process is deterministic and 
is given by a differential equation, sometimes called a fluid limit. The relevant probabilistic 
literature, though well developed, may not be readily accessible to non-specialists seeking 
to apply the results in other fields. One field where fluid limits of Markov processes are 
beginning to find interesting applications is random combinatorics. Wormald ^B] and co- 
workers have put forward a set of criteria which is specially adapted to this application. 
The material in this section may be considered as an alternative framework, somewhat 
more rigid but, we hope, easy to use, developed with the same applications in mind. 

Let (X^) t >o be a sequence of pure jump Markov processes in R d . It may be that 
(X t JV ) t > takes values in some discrete subset I N of R d and that its Levy kernel K N (x, dy) 
is given naturally only for x G I N . So let us suppose that I is measurable, that (X^) t > 
takes values in I N , and that the Levy kernel K N (x,dy) is given for x G I N . Let S be 
an open set in M. d and set S N = I N fl S. We shall study, under certain hypotheses, the 
limiting behaviour of (X^)t>o as N —* oo, on compact time intervals, up to the first time 
the process leaves S. In applications, the set S will be chosen as the intersection of two 
open sets H and U. Our sequence of processes may all stop abruptly on leaving some open 
set H, so that K N (x, dy) = for x H. If this sort of behaviour does not occur, we simply 
take H = W d . We choose U so that the conjectured fluid limit path does not leave U in 
the relevant compact time interval. Subject to this restriction we are free to take U as 
small as we like to facilitate the checking of convergence and regularity conditions, which 
are required only on S. 

The scope of our study is motivated by the particular model which occupies the remain- 
der of this paper: so we are willing to impose a relatively strong, large deviations-type, 
hypothesis on the Levy kernels K N , see (0) below, and we are interested to find that strong 
conclusions may be drawn using rather direct arguments. On the other hand, in certain 
cases of our model, the fluid limit path grazes the boundary of the set S: this calls for a 
refinement of the usual fluid limit results to determine the limiting distribution of the exit 
time. 
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5.2 Assumptions 

Consider the Laplace transform 

m N (x, 9) = / e {e ' y) K N (x, dy), x G S N , 6 G (R d )* 



We assume that, for some r/o > 0, 

m N (x,N6) 

sup sup sup — < oo. (6) 

n x &s N \9\<no N 

Set b (x) = m (x,0), where ' denotes the derivative in 9. We assume that, for some 
Lipschitz vector field b on S, 

sup \b N (x)-b(x)\^0. (7) 

We write b for some Lipschitz vector field on M. d extending b. (Such a b is given, for example, 
by b{x) = swp{b(y) — K\x — y\ : y G S} where K is the Lipschitz constant for b.) Fix a point 
xo in the closure S of S and denote by (x t ) t >o the unique solution to x t = b(x t ) starting 
from x . We assume finally that, for all 5 > 0, 

limsupJV -1 logP(|X^-a:o| > 5) < 0. (8) 

Whilst these are not the weakest conditions for the fluid limit, they are readily verified in 
many examples of interest. In particular we will be able to verify them for the Markov 
chains associated with hypergraph collapse in Section 3. 



5.3 Exponential convergence to the fluid limit 

Fix t > and set 

T N = inf{t > : X t N £ S} At . 
Proposition 5.1. Under assumptions (G|) ; (0), we have, for all 5 > 0, 

lim sup iV~ 1 logP( sup \X? - x t \ > 5) < 0. (9) 

N^oo t<T N 

Proof. The following argument is widely known but we have not found a convenient refer- 
ence. Set b N (x) = m N '(x,0) and define (M t Ar ) t > by 

X? = X? + M* r + fb N (Xf)ds. 
Jo 
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Note that M t corresponds to the martingale we identified in Proposition 14.11 Fix 77 G 
(0,770). Assumption (JHJ) implies that there exists C < 00 such that, for all N, 

\m N "{x, 6) I < C/N, x G S N , \9\ < Ntj. 

Compare this estimate with (J4"j). By applying Proposition 14.11 to the stopped process 
(X^ TN )t>o, we find constants Eo > and Cq < 00, depending only on C,r],d and to 
such that, for all N and all s G (0, eo], 

P( sup |Mf| > e) < Coe-^ 2 / 00 . (10) 

t<T N 

Given 5 > 0, set e = min{e~ Kt °5/3, Eq}, where K is the Lipschitz constant of b. Let 

n N = {{X^ -x \<e and sup \M t N \ < e}. 

t<T N 

Then (JHJ) and (JTUJ) together imply that 

limsup^ 1 logP(fi\fi iV ) < 0. 

On the other hand, by (J7J), there exists iVo such that \b N (x) — b(x)\ < s/to for all x G S N 
and all N > N . We note that 

X? -x t = (X N - x ) + M t N + f\b N (X») - b{X?))ds 

Jo 

+ / (b(X?) - b(x s ))ds 
Jo 

so, for N > N , on Q N , for t < T N , 

\X? -x t \<3£ + K [ \Xf - x s \ds 
Jo 

which implies, by Gronwall's lemma, that sup i<T iv — x t \ < 6. □ 
5.4 Limiting Distribution of the Exit Time 

The remainder of this section is concerned with the question, left open by Proposition 15.11 
of determining the limiting distribution of T N . Set 

r = inf{t > : x t i S} At , 
T = {tE[0,T):x t £S}. 
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It is straightforward to deduce from (jHJ) that, for all 5 > 0, 



limsupiVMogP ( inf \T N - 1\ > 8 J < 0. 



(11) 



In particular, if T is empty, then T — > r in probability and, for all 5 > 0, 

limsupiV" 1 logP(sup \X? - x tAr \ > 5) < 0. 

N^oo t<t 

The reader who wishes only to know the proof of Theorem 12.11 may skip to Section as 
the remaining results of this section are needed only for the more general case considered 
in Theorem r 



5.5 Fluctuations 

We assume here that 

T is finite. (12) 

In this case the limiting distribution of T N may be obtained from that of the fluctuations 
7^ = y/N (X^ TN — x iAT iv). We assume that there exists a limit kernel K(x,dy), defined 
for x G S such that m(x, 6) < oo for all x G S and \9\ < i] , where 



mix, 



I e {e ' y) K(x,dy), xeS, 9 



For convergence of the fluctuations we assume 

7 — > 7o in distribution, 



(13) 



sup sup 

xes N \e\<T]o 



m N (x,N9) 



N 



m(x, 9) 



where b N (x 



sup VN\b N (x) -b(x)\ -> 0, 

xes N 

a is Lipschitz and b is C 1 on S, 
m (x, 0) and a(x) = m"(x, 0). Of course ()14j) will force b(x) 



(14) 
(15) 
(16) 



m'(x, 0). 



Page 17 of E2I 



17 



Darling & Norris 



Random Hypergraphs 



5.6 Limiting Stochastic Differential Equation 

Consider the process (jt)t<T given by the linear stochastic differential equation 

d lt = cr(xt)dB t + Vb(x t )j t dt (17) 

and starting from 70, where B is a Brownian motion and o~(x)a(x)* = a(x). The distribu- 
tion of (jt)t<T does not depend on the choice of a. For convergence of T N we assume, in 
addition, 

dS is C 1 at Xt with inward normal n t , and P((n t ,7 t ) = 0) = 0, for all t ET. (18) 

Theorem 5.1. Under assumptions (0), J22J), W, 02P> W, W, we have T N -> T 
m distribution, where 

T = mm{t G T : (n t , 7*) < 0} A r. 

Proof. Let To = and write the positive elements of T as ri < • ■ ■ < r m . Define, for 
= 0, 1, . . . , m, 



l N k 



ifT N >r k 
otherwise, 



if T > r k 
otherwise, 



Ik = 

where d is some cemetery state. We will show by induction, for k — 0, 1, . . . , m, that 

(7o Y > • • • ,7k) (7o, ■■■,%) in distribution. (19) 

Given ffTTj) . this implies that T N — > T in distribution, as required. 

Note that both (7^)o<fc<m and (7fe)o<fc<m niay be considered as time- dependent Markov 
processes. Hence, by a conditioning argument, it suffices to deal with the case where 70 
is non-random. By (|18|). if x G SS", we can assume that dS is C 1 at x and (no,7o) 7^ 0. 
Moreover, for the inductive step, it suffices to consider the case where 7^ is non-random, 
not d, and to show that, if 7^ — > 7^ in probability, then 7^ — > 7 fe+1 in distribution. We 
lose no generality in considering only the case k = 0. 

We have assumed that 7^ — > 70 in distribution. Note that T = if and only if x e dS 
and (no,7o) < 0. On the other hand, since X$ = x + VN'yQ , we have F(T N = 0) — > 1 if 
and only if x G dS and (no, 70) < 0. Hence 7^ — > 70 in distribution, that is, (|T5|) holds 
for k = 0. 

In Lemmas 15. 4\ 15.51 and 15.61 below, we will show that, if xq G S, or xq G dS and 
(no, 7o) > 0, then 

F{T N > e) -> 1 for some £ > 0, 
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and, in the case m > 1, 

7^ — > j n in distribution, 
P((n T1 , 7 ^} >0and T N < n ) 0, 
P((n T1 ,7^) < and T N > t x ) -> 0. 

It follows that 7^ — > 71 in distribution, so (fTT?|) holds for A; = 1. This establishes the 
induction and completes the proof. □ 

We remark that the same proof applies when the Levy kernels K have a measur- 
able dependence on the time parameter t, subject to obvious modifications and to each 
hypothesis holding uniformly in t < t . 

For the remainder of this section, the assumptions of Theorem 15.11 are in force and 70 
is non-random. 

Lemma 5.2. For all e > there exists A < 00 such that, for all N 

P(sup | T f I > A) < 6. 

t<t 

Proof. Given e > 0, choose A < 00 and Nq such that, for A' = e~ Kt °X/3 and N > Nq 

VN\b N (x) - b(x)\ < X'/to, x e S N 
and, with probability exceeding 1 — e, 

iTo^l < A', 
Vn sup \M t N \< A'. 

t<T N 

This is possible by ()10jl and fj!5jl . These three inequalities imply 

bf| <3V + Jf /" |7f|da, t<T 
Jo 



N 



so, by Gronwall's lemma 

sup | 7 f I < A. □ 

t<T N 

□ 

Lemma 5.3. For all e > there exists A < 00 sitc/i £/iat ; for all 5 > 0, there exists N$ < 00 
such that, for all N > N$ and all t <t , 

P ( sup | 7 f - 7 f I > Av 7 ^ ) < £ 

\s<to,t<s<t+<5 / 
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Proof. Consider first the case t — 0. Given e > 0, choose A < oo such that, for all 5 > 0, 
there exists N s such that, for A' = e Kt °X/3 and N > N s 

VN\b N (x) - b(x)\ < X'/VF , x G S N , 
and, with probability exceeding 1 — £, 

|7?l < A'/^v^, 



iV sup |Mf| < A'v 7 ^. 

i^T^ A(5 

This is possible by (jl(Jj) and (|15|) . These three inequalities imply 



\^-^\<3\'V5 + K [ 

Jo 

so by Gronwall's lemma 



V-To^, t<T N A6, 



sup | 7 f- 7o ^|<AV5. 

t<T N A& 

The case t > follows by the same sort of argument, using Lemma 15^1 to get the necessary 
tightness of 7^. □ 

Lemma 5.4. Suppose either x G S, or x G OS and (n , 70) > 0. T/ien P(T Ar > e) — > 1 
as N 00 for some e > 0. 

Proof. The case xo G S 1 follows from ((TTJ) . Suppose then that xo G <9>S and (no, 70) > 0. 
Then, since is C 1 at xo, for all e > 0, there exists (5(e) > such that, for all x G S with 
k — x | < 5(e), and all t> G M d , 

|f I < 5(e) and (n 0) f ) > s|f | =^ a; + u G S. (20) 

Since (n ,7o) > 0, by Lemma EHA given e > there exist £1 > and AT such that, for all 
N > N and t < T N A £1, 

K,7f>>^i|7f|, l7fl<lM, 

with probability exceeding 1 — £. Choose £2 G (0,£i) so that |x t — Xq\ < <5(£i) and x t E S 
whenever t < e 2 . Set N x = max{A^ , (^t^i)) -2 }, then, for N > Ni and t < T N A £ 2 , 



e S, \x t -x \<5(e 1 ), N~ 1 ^\<5(e 1 ), (n 0)7 f ) > eibfl, (21) 



with probability exceeding 1 — e. By (I20J) . ()21)1 implies = x t + iV 1 ' 2 7^ v G 5. Hence 
P(T 7V < £ 2 ) < £ for all N>N!. □ 
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For the rest of this section we assume that m > 1. (The next result holds with n 
replaced r when m = 0, by the same argument, but we do not need this.) 

Lemma 5.5. Suppose either x e 5, or xo G (95* and (no, 70) > 0. TTien 7^ — > j n in 
distribution as N —>■ 00. 

Proof. By Lemma f5. 31 given 5 > 0, we can find t < T\ such that, for all N, 

P(l7f - 7? I > < 5, P(| 7 * - 7n I >S)<5. 

Hence it suffices to show 7^ — > j t in distribution for all t < T\. 
Define (ip t )t<r in R d ® (M d )* by 

■0 t = Vb(x t )ip t , ipo = id- 
Fix 9 e (R d )* and set t = {ijj* t )~ 1 9. Then 

d{6 ult ) = (6 t ,a(xt)dB t ), t < r, 

so ^ 

(0 t)Tt ) ~ iV«0,7o>, / (9 s ,a(x s )9 s )ds), t < r. 
Jo 

On the other hand, for (M^) t > as in the proof of Proposition 15.11 

N\ . /~\t 1 n jj|yrAT\ , r>N,8i. . ^ rpN 



d(9 t , It) = VN(9 t , dM?) + R?'°dt, t < T 
where 

R?> e = y/N(9 t , b N (X t N ) - b(x t ) - Vb(x t )(X t N - x t )). 

By (USD, 

sup ^/N\b N (X t N )-b(X t N )\^0. 

t<T N 

By (|TBjl. given e > 0, there exists 5 > such that, for all t 6 [e, tj — e], for |x — x t | < 5, 

\b(x) — b(x t ) — Vb(x t ){x — x t )\ < s\x — x t \. 
Hence \X^ — xt\ < 5 and e < t < T\ — e imply 

yfN\b{X») - b(x t ) - Vb(x t )(X? - x t )\ < £ |7f|. 
Combining this with Lemma we deduce that 

Tl 

\R^' e \dt ^ in probability. 
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Hence it suffices to show, for all 9 G and all t < n, 

VN [ (6 S , dMf) -»• JV(0, / (0„ a(> s )0 s )ds) in distribution. 
Jo Jo 

Indeed, it suffices to show, for all 6* G (M. d )* and t < ri, that K(E^' e ) —>■ 1 as iV — > oo, 
where t ^ 

E?' = exp{zVN [ (e a ,dM?) + l [ (9 s ,a(x s )9 s )ds}. 
Jo * Jo 

Set m N (x, 9) = m N (x, i9), rh(x, 9) = m(x, i9) and 

#"(M) = / (e«-l-i(0,y»^,dy). 

By fjl4|) . for all r] < r] , we have 

sup sup |JVm""(a:, iV0) - m"(z, 0)| -»• 0. 
zeS^ |0|<tj 

Note that 

I 



(f> N (x,VN9) + ^(9,a(x)9) 



i 

'Nfh m {x, VNr9) - m"{x, 0) ) (0, 0)(1 - r)dr 



o 



so, for all p < oo, 

sup sup |0 (sc , + -(0,a(x)0)| 0. (22) 



Write E^ e = E? = Z? A? B? , where 

Z? = exp{zv / iV f\e a ,dM?)- f 4> N (X?,VN9 s )ds}, 
Jo Jo 

A? = exp{j\t N (X», VN9 S ) + 1 -{9 S , a(X?)9 s ))ds}, 

B? = exp{jT ^<0 S) (a(ar,) - a(Xf ))0 s >ds}. 

Now (Z^ TN )t< r is a martingale, as in (J3J), so ~K(Z^ ATN ) = 1 for all N. Fix t < r. By 
^t/\T N * s bounded, uniformly in iV, and ^4^ T iv — > 1 uniformly as iV — > oo. Moreover, 
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by (JTBj). B^ ATN is bounded uniformly in N and converges to 1 in probability, using 0. 
Hence 

E(Z tATN A tATN B tATN ) — ► 1 

as JV -»• oo. By Lemma IO and (fTTl) . P(T JV > t) -»• 1 for all t < n. It follows that 
E,(E^) — > 1 for all £ < n as required. □ 

Lemma 5.6. Suppose either xq G S, or xq G dS and (no,7o) > 0. Then, as N oo, 

P((n n ,7^) > andT^ < n) -> 0, 
P((n n)7 5 < and T N > n) -> 0. 

Proof. By Lemma f5. 51 given e > 0, there exists ej > and iVo such that, for all N > N 

\(n T1 ,^ 1 )\>s 1 \^ 1 \, s 1 <\^ l \<l/s 1 , 

with probability exceeding 1 — e. Then by Lemma 15.31 there exists £ 2 > and N\ > No 
such that, for all N > N\, with probability exceeding 1 — e, either 

(n ri , 7 f}>£ 2 | 7 f|, |7fl<Vea for aU t G [n -e 2 ,n] (23) 

or 

K,7^)<-^|7^l, |7?l<l/e 2 . (24) 
Since dS* is C 1 at x T1 , there exists 5 > such that 

if x G 5 and v EM. d with |x — x Tl \ < 5, \v\ < 5 and (n Tl ,v) < ez\v\, then x + v E S 

and 

if t> G R d with \v\ < 5 and (n T1 , v) < — £ 2 |t>|, then x T1 + f ^ S 1 . 

Choose £3 G (0,e 2 ] such that \x t — x n \ < 5 and x t G S whenever t G [t% — £3,71]. Set 
7V 2 = max^, (£ 2 5)- 2 }. Then, for N > N 2 , since X? = x t + N' 1 ' 2 ^ on {T^ > *}, 
© implies G 5 for all t G [n - £ 2 ,ti] or T N < n - £ 2 , and (J21 implies £ 5 
or < n. We know by Lemma O and (PJ that P(T iV < n - e 2 ) -» as N -> 00. 
Hence, with high probability, as X — > 00, (n T1 ,7^) > implies ([23|) and then T N > Ti, 
and (n n , 7^) < implies )22) and then T N < t x . □ 

6 Fluid Limit of Collapsing Hypergraphs 

We now apply the general theory from the preceding sections to prove our main results 
Theorems 12.11 and 12.21 
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6.1 Levy Kernel for Collapse of Random Hypergraphs 

In Section El we introduced a Markov process (A n ) n > of collapsing hypergraphs, starting 
from A ~ Poisson(/5) and stopping when n = \V*\, the number of identifiable vertices in 
A . The process (Y n , Z n ) n > of patches and debris in A n was found itself to be Markov. We 
now view this process as a function of the initial number of vertices N and obtain a fluid 
limit result when N — > oo. 

It will be convenient to embed our process in continuous time, by removing vertices 
according to a Poisson process {vt)t>o of rate N which stops when v t = \V*\. Set 

Xf = N~\v u Y Vt ,Z Vt ) 

and note that X N takes values in 

I N = {x <EM. 3 : Nx 1 G {0,1, . . . , N — 1}, Nx 2 , Nx 3 G Z+} U {(1, 0, x 3 ) : Nx 3 G Z+}. (25) 

The Levy kernel K N (x,dy) for (X i iY ) 4 > is naturally defined for x G I N . If x 2 = then 
K N {x, dy) = 0. If x 2 > 0, then iV" 1 ^ X, *J is db probability measure; by Lemma f3. 11 it is 
the law of the random variable J N /N, where 

J N = (1,-1-W N + U N ,1 + W N ), 

W N ~ B(Nx 2 - 1, 1/(A^-A^a; 1 )), U N ~ P((N - Nx 1 - 1)X 2 (N, Nx 1 )) 

with W N and U independent. 

Recall that R denotes the radius of convergence of the power series (3{t), given by (JTJ). 
We assume, until further notice, that R > and fix to G (0, R A 1) and p G (to, RAl). 

Lemma 6.1. There is a constant C < oo such that 

\NX 2 {N,n) - (3"{n/N)\ < C{logN) 2 /N 

for allN G N and n G {0, 1, . . . , [Np]}. 

Proof. Recall that 

n(n — 1) . . . (n — i + 1) 



\ 2 (N,n) = Nj2( i + 1 )( i + 2 )& 



i+2 



i=0 



N(N - 1) . . . (N - % - \) 



Set M = AlogN where A = (log^R/p))' 1 < oo. Then, for n < [Np], 
\N\ 2 (N,n)-P"(n/N)\ 

M An oo 

<j2(i+m+m + A(N,n)+2 j2 {i+m+^^p^N,^ 

i=l i=M+l 
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where 



Si{N,n) 



N 2 n f n - 1 \ ( n-i+1 

{N - i){N - i - 1)N \N-1 )"' \N-i + l 



and 



Pi {N,n) 



N 2 



n 



(N -i)(N -i- 1) \N 
Note that, for j — 0, . . . , i — 1 and i < M A n, 



Tr) < C P l - 



n — j n 



< A log N/N 



N-j N 

so, making use of the inequality | Yi a j ~ Yl bj\ < Yl \ a j ~ bj\ for < a,j, b 3 < 1, we obtain 

<5;(iV,n) < C(logiV) 2 p7iV. 

Hence 



|JVA 2 (iV,n)-/3"(n/iV)| < C (log N) 2 /3" (p)/N + C(p/R) 



and (p/R) M = l/N 



6.2 Fluid Limit 



□ 



The main result of this section is to obtain the limiting behaviour of (X t ) t >o as iV — > oo, 
which we deduce from Proposition 15. II and Theorem 15.11 We present first the calculations 
by which the limit was discovered. 

Note that, as N — > oo, for x 1 < R A 1, we have W N —>■ W and U N — > U in distribution, 
where 

W ~ P(x 2 /(1 - x 1 )), [/ ~ P((l - x 1 )/?"^ 1 )). 

Set J = (1,-1-W+17, 1+W). Note also that -> x = (0,/?i,/3b) and y/N^-xo) -> 
7o in distribution, where 7q = 0, 7 2 ~ iV(0, 7q ~ -^(0, /3o), with 7 2 and 7q independent. 
Thus, subject to certain technical conditions, to be checked later, at least up to the first 
time that X^ 1 > R A 1 or X^' 2 = 0, the limit path is given by it = b(xt), starting from 
xq, where 

b(x) = E(J) = ( 1, -1 - + (1 - x'Wix 1 ), 

\ 1 — x 1 1 — X 1 J 

Fix p' G (0, oo) and set 



S = {(x\x 2 ,x 3 ) : (x 1 ! < p, x 2 e{0,p), x 3 eR} 



(26) 
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then b is Lipschitz on S and, for p' sufficiently large, the maximal solution on [0, to] to 
x t = b(x t ) in S starting from xo is given by (x t )t< T , where 

xt = (t, (f - t){(3\t) + log(l - t)), /3(t) - (1 - 1) log(l - t)) 

and 



t = z* At, 



o- 



6.3 Limiting Fluctuations 

Set a(x) = E(J® J). A convenient choice of a such that aa* = a is a = (Vf, V2, V3), where 



V &) = \ 7~~~ — T 1 h ^(x) = v/(l - x 1 )/?"^) 1 , F 3 (x) = 6(x) 




Note that a is a Lipschitz and b is C 1 on 5*. The limiting fluctuations are given by 

dj t = v A x t)dB\ + Vb{x t )j t dt, t < r 

i 

starting from 70, where B is a Brownian motion in R 3 independent of 70. Note that 

T = {te[0,T):x t <£S} = (n[0,t o ). 

In cases where T is non-empty, the limiting behaviour of (Xf)t<t depends on the signs of 
the component of the fluctuations normal to the boundary, that is, on (7I : t G T). 
Note that 9 t = b(x t )Bf satisfies 

d6 t = V 3 (x t )dB'f + Vb(x t )6 t dt. 

This is the part of the fluctuations which reflects our Poissonization of the time-scale. Since 
b 2 (x t ) = for all t G T, it does not affect (j 2 : t G T). So consider 7 4 * = 7* — 0$. Then 

d 7i * = ^(a^dB* + V 2 (x t )dB 2 + Vb(x t )tfdt. 

Note that ^(ar) = ^(x) = and Vb 1 (x) = 0, so (7;) 1 = (7*) 1 = for all t. Also 
Ob 2 /dx 2 = -1/(1 - x 1 ) and «% 2 /<9x 3 = 0. Also x\ = t and x 2 /(l - x\) = (3'{t) + log(l - t). 
The sign of (7 t *) 2 is the same as that of a t = (7 4 *) 2 / (1 — £). We have 

da t = d7 t * 2 /(l - t) + 7 t * 2 /(l - t) 2 rft 
= [V 2 {x t )dB] + V?(z t )dS t 2 )/(l - t) 
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so we can write at = W(af), where W is a Brownian motion and 

2 f w +iog(i-s) + (i-sy M(js 

Jo 1 - s 

_ /?'(*) + log(l-t)+t 

1-t 

We have shown that (sgn (7^) : t G T) has the same distribution as (sgn(W / t /(i_ t )) : t G T). 
In particular P(7 2 = 0) = for all t G T. 
Recall that Z is defined by 

Z = mm{z eC,-W (z/1 - z) < 0} A z*. 

Set 

T w = inf{t > : X? ' 2 = 0} 
and put Z(t ) = Z A t Q , T N {t ) = T N A *„• 

Theorem 6.1. For all 5 > we /iave 

lim sup iV _1 logP ( sup \X* T -x t \>6] < 0. 

iV-*oo \t<T N (t ) J 

Moreover, T N (to) — > Z(to) in distribution as N — > 00. 

Proof. We defined I N , the state-space of (Xf ) t >„, in (|1>D, and 5 in flUJ). Set = I N f)S. 
For a; G 5 we have 

m N (x, 6) = [ e^K N (x, dy) = NE ( e ^/ N ) 

so 

m N (x,6) f „ n n n / 2 1 1 #3 - #2 



+ P f(iV - iVx 1 - l)A 2 (iV, Nx 1 ), I J |. 

where, for 6» G M, we write B(N,p,0) = Mog(l -p + pe e ) and P(A,0) = A(e e - 1). So, by 
Lemma 6.1, 

m N {x,N6) 



sup sup 



A' 



— m(:r, 1 
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as N — > oo, for all r] > 0, where 



mix. 



0) = E(e< e ' J >) = exp{0! - £ 2 + 6 3 + P (^T- ^3 - O^j + P ((1 - ^"(x 1 ), 2 )}. 



Set 

&"(x) = ! yK N (x,dy) = E(J N ) 
Jm. 3 

then, by Lemma 7.1, 

sup V^V \b N (x) -b(x)\ ->0. 

Recall that 

^ = 0, NX™ ~ PiNPi), NX™~ P (Nf3 ) 
and xo = (0, j3\,j3o). By standard exponential estimates, for all 5 > 

limsup N' 1 \og¥(\X* - x | > 5) < 0. 

AT->oo 

We have now checked the validity of (f7jl. Q. (fT2|l . (fTHjl . (fTljl . (fTHj) . (fTfij) . (fTSjl in this context, 
so Proposition 15.11 and Theorem 15.11 apply to give the desired conclusions. □ 

Remark 6.2. If z* < 1, then z* < R A 1, so by choosing to G (z*,R A 1) we get -Z^o) = 
and, as iV — > oo, with high probability T N (to) = T N . Hence, when z* < 1, Theorem 16.11 
holds with Z and T N replacing Z(to) and T N (to). In particular, Theorem 12 . II follows. 

6.4 Proof of Theorem \Z72\ 

Proof. Recall that 

"|yJV*| lA^*! 



X^n — ( - — 



N N 

Let z G C U {z*}. If 2 < 1, then z < R A 1 so, by choosing t Q &(z, RAl) in Theorem 15. 1| we 
obtain 

F\\X$ N -x z \<6j ^F(Z = z) (27) 

for all sufficiently small 5 > 

It remains to deal with the case z = z* = 1. Note that (V^*! < iV and \A N *\ < \A N \. 
Now \A N \ ~ P(N(3(1)) so lA^I/iV -> /3(1) in probability as iV -> oo. It therefore suffices 
to show, for all 5 > and a < (3(1) — 5, 

/\V N *\ \A N *\ \ 
liminf PfJ L > 1 - 5 and J ^ > a > P(Z = 1). 

JV->oo \ N ~ N ~ J ~ ' 
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When combined with (J27|) this completes the proof as we have exhausted the possible values 
of Z. 

We consider first the case R > 1. We can find to £ (1 — 5/2, 1) such that /3{t ) > a + 5/2. 
Note that \Xg - x to \ < 5/2 implies 

\V N *\/N > Xg' 1 > t - 5/2 > 1 - a, 

\A N *\/N > X^' 3 > /3{t ) - (1 - t ) log(l - to) - 5/2 > a. 

By Theorem 16.11 

liminf P( sup |Xf - x t \ < 5/2) > F(Z > t ) > F(Z = 1) 

N^oo \ t < tQ J 

so we are done. 

Consider next the case R = 0. Fix M G N and set /3j = (3j if j < M and /3j = 
otherwise. Then, with obvious notation, we can choose M so that z > 1 — 5/2, ( = and 
/3(2o) > a + 5/2. Hence 

if/- N* I IA^*I 
F'l > 1 - 5 and > o 1 — I . 

We can couple A and A so that A(A) = A(A)1 ]A{ < M . Then V N * C y N * and A^* < A^*, 
so this is enough. 

There remains the case R G (0, 1). In this case ( is finite. We have assumed that R ^ £. 
So we can find p G (sup £, i?) and MgN such that, with obvious notation, 

zq>1- 5/2, C = C, /3(5o) > a + 5/2 

where (3{t),t G [0, 1), is defined by 



0(0) = ft, /3'(0)=A, /3"(t) 



/3"(t), t < p, 



Consider the collapsing hypergraph (A^) n > which evolves as (A^)„> up to n = u(p), 
at which time all hyperedges having at least two vertices and originally having more than 
M vertices are removed, so that A^ < A^. After u(p), (A,^) n > evolves by selection of 

patches as before. Denote by V^* the set of identifiable vertices in A^ % and by A^* the 
corresponding identifiable hypergraph. Then 

V N * C V N * and A^* < A N *. 
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A modification of Theorem 15.11 shows that 



F[\X»-x~ Z0 \<5/2) - F(Z = z )=F(Z = l) 



with Xf = flV^-l/JV.O, lA^I/JV) and with 

x t = (t, (1 - t)0(t) + log(l - t)), /3(t) - (1 - t) log(l - *)). 

All that changes in the proof is that, for t > p the Levy kernel is modified by replacing A2 
by A 2 given by 

The argument of Lemma 6.1 shows that for all p' < 1 there is a constant C < 00 such 
that 

|ATA 2 (A^,n) -p"(n/N)\ < C/N 
for all N G N and n = {0, 1, . . . , [iVp']}. Everything else is the same. Now \X± — xg \ < 5/2 

implies 

IV^I/JV > \V N *\/N = X?' 1 >z -6/2>l-5, 
\A N *\/N > \~A N *\/N = xf 3 > fa) - (1 - So) log(l - z ) -5/2>a 

so 

|yiV*| | A 7V* 



liminf P - > 1-5 and 1 > a > P(Z = 1) 

\ AT - N ~ J ~ v 7 

as required. □ 

Acknowledgments. The authors thank the Mathematisches Forschungsinstitut 
Oberwolfach for the invitation to the meeting in 2000 at which this collaboration 
began, and Brad Lackey for his interest in this project. David Levin helped us 
to articulate some of the concepts presented herein. 



Page 30 of El 



30 



Darling & Norris 



Random Hypergraphs 



References 

Achlioptas, Dimitris, Lower bounds for random 3-SAT via differential equations. Phase 
transitions in combinatorial problems (Trieste, 1999). Theoret. Comput. Sci. 265 
(2001), no. 1-2, 159-185. MR1848217 

Achlioptas, Dimitris; Kirousis, Lefteris M.; Kranakis, Evangelos; Krizanc, Danny Rig- 
orous results for random (2 + p)-SAT Phase transitions in combinatorial problems 
(Trieste, 1999). Theoret. Comput. Sci. 265 (2001), no. 1-2, 109-129. MR1848214 

Bollobas, B., Random graphs. Academic Press, London, 1985. 

Coppersmith, Don; Gamarnik, David; Hajiaghayi, Mohammad; Sorkin, Gregory B. 
Random MAX SAT, random MAX CUT, and their phase transitions. Proceedings of 
the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms (Baltimore, 
MD, 2003), 364-373, ACM, New York, 2003. MR1974940 

Darling, R.W.R.; Levin, D.A.; Norris, J. R., Continuous and discontinuous phase tran- 
sitions in hypergraph processes. Preprint. http://xxx.lanl.gov/abs/math.PR/0312451 

Ethier, S. N.; Kurtz, T. K., Markov processes: characterization and convergence. 
Wiley Series in Probability and Mathematical Statistics. Wiley, New York, 1986. 

Duchet, Pierre, Hypergraphs. Handbook of Combinatorics, edited by R. Graham, M 
Grotschel, L. Lovasz, Elsevier Science B.V., 1995 

Erdos, P.; Renyi, A., On the evolution of random graphs. Magyar Tud. Akad. Mat. 
Kutato Int. KozL, 5, (1960), 17-61. 

Jacod, J.; Shiryaev, A. N., Limit Theorems for Stochastic Processes. Springer, Berlin, 
1987 

Kallenberg, O., Foundations of Modern Probability. Springer, New York, 1997. 

Karohski, M.; Luczak, T., Random hypergraphs. Combinatorics, Paul Erdos is eighty, 
Vol. 2 (Keszthely, 1993), 283-293, Bolyai Soc. Math. Stud., 2, Janos Bolyai Math. 
Soc, Budapest, 1996. [97m:05232] 

Kordecki, Wojciech, On the connectedness of random hypergraphs. Comment. Math. 
Prace Mat. 25 (1985), no. 2, 265-283. [87k:05140] 

Norris, J.R., Cluster coagulation. Commun. Math. Phys. 209 (2000), 407-435. 



Page 31 of E2 



31 



Darling & Norris 



Random Hypergraphs 



[14] Schmidt-Pruzan, Jeanette; Shamir, Eli, Component structure in the evolution of ran- 
dom hypergraphs. Combinatorica 5 (1985), no. 1, 81-94. [86j:05106] 

[15] Wilson, David B. On the critical exponents of random k-SAT. Random Structures and 
Algorithms 21 (2002), no. 2, 182-195. MR1917357 

[16] Wormald, Nicholas C, Differential equations for random processes and random graphs. 
Annals of Applied Probability 5 (1995), no. 4, 1217-1235. [97c:05139] 



Page 32 of E2 



32 



